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ABSTRACT 

This paper constitutes S Summary of a seminar entitled ‘'CcmmenI I ng 
Proofs' 1 given at the Artificial Intelligence Laboratory during the 
Spring of 1974. The writ is concerned with new syntactic. Structures 
In formal proofs which derive from their pragmatic and semantic as pec 
It Isa Synthesis of elements from Yessenin-Vo!p1n 1 5 foundati Ona I 
Studies and developments in Artificial Intelligence concerned with 
commenting programs and the use of this idea in automatic debugging 
procedures* 
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1. Introduction. 


This paper constitutes a summary of a seminar entitled 
"Commenti ng Proofs" given at the A. I, lab at M.l.T. during the 
Spring of 1974. The work is concerned with new syntactic structures 
in formal proofs which derive from their pragmatic and semantic 
aspects. It is a synthesis of elements from Yassenin-Volpin's 
foundational studies fe.g. "U1trafattritionIsm and the Anti traditional 
Prograin for the Foundation of Mathematics"-, Proceedings of the 
Sumer Conference on Intuition15m and Proof Theory at Buffalo, New 
York* 1968) and developments in Artificial Intelligence concerned 
with commenting programs and the use of this idea in automatic 
debugging procedures fe.g, Gerald Sussmanl's doctoral Thesis: 

"A Computational Node I of Skill Agufsftion". M.I.T.,1973J. 

For the most part we shall restrict Ourselves to the context 
of Peano Arithmetic and the primitive recursive arithmetic of' addition, 
multiplication, and exponentiation in particular. At the end -a few 
remarks will be made on how these ideas are to be extended to a 
richer deductive environment. 

In our work we shall introduce means whereby it becomes possible 
to distinguish between formal proofs {a sequence of sentences satis¬ 
fying the usual syntactical criteria) and "real 1 ' proofs fa formal 
proof constructed by someone with the goal of proving the theorem). 

In the latter case each line may be commented by intrinsic (formal- 
syntactic) and extr insic fgoal related) remarks. These remarks not 


only explain the purpose of a particular line, but at the Same time 




e&tabllih connections with other lines [previous lines and ones yet to be 
constructed)-'in fact, these remarks point to connections between the very 
signs that make up the lines of the proof. These connections {which shall 
be called identlf leattonaT connections nr Ids) are the links of a very detailed 
syntactic structure which resides implicitly In real proofs> a structure 
of "causa V chains connecting the occurences of Symbols. 


With this sort of information it becomes possible to answer a question 
like: "What part of the term 11 111 is responsible for the third stroke 
In the right hand side of the equation 11*111 = 11 nil?". Such a part 
will be called an ingredient of a tern, consisting of a certain form 
of list structure whose atoms are the occurences of symbols in the term. 

Much of the present work Is concerned with characterising the dependence 
of ingredients on the computational paths used to evaluate a term. In the 
larger program of which the present work Is a part, proofs are seen as the 
codification of procedures for manipulating list structures in such a 
way as to induce mappings from ingredients to ingredients. This provides 
a framework for understanding various traditional phenomena such as con¬ 
sistency and independence. On a higher level It suggests new ways to 
fomallie such notions as relevant entailment and methods of proof. One 
last point before getting Into the details. An equivalence relation is 
defined on ingredients so that every ingredient is equivalent to an ingredient, 
in "normal“ form. This form appears to be connected to a notion of computa¬ 
tionally efficient computation paths. 

2. Recursive Arithmetic. 

Recursive Arithmetic consists syntactically of terms and equations, and 


deductively of computations fcontructed by means of the recursion axioms 
and the substitution rule). 


Terms are expressions formed from 1, + » J ,exp,{,) by means of the 
following rules. 

1) 1 is a term. 

£) If t is a term then so is tt, 

3) If t and S are terms then so are (t + s)^ t-s, and e*p(t,g). 

We shall use the symbols tpr,5,.,ti*rj,5^ ,... to denote terms. We 
shall make use of the usual conventions for ignoring parenthesis. Also 
note that our use of the word "term" doesn’t allow for the occurences of 
free variables, i.e. they are always to be closed- 1 ) 

Terms of the form 1, 11, ..., l^ n ^ fn concatenated strokes) are called 

numerals . 

An equation is an expression of the form t •= s. 


The Recursion Axiom. Schemata: 

Addition A1 (t + 1) * tl. 

A2 (t * si] * {t + S)1 
Multiplication Ml t-1 ■ t. 

m t - (si ] ■ t-s + t. 


Exponentiation El exp(tj) a t. 

E2 exp{t,sl) H exp(t,s)-t. 

1} The notational methods that we shall develop for +,■, and exp, will 
serve to handle all other primitive recursive functions. 







The Substitution Rule Schema; 

Line LA t ■ s(r) 

Line LB r m r 1 

Line LC t“s(r'). 

Here s(r) denotes a tern In which the term r has one or more Indicated 
occurences, and s(r^) denotes the term resulting from the replacement in 
s(r) of r by r 1 at the indicated occurences of r in s{rj. Note that this 
is more general than uni fern substitution. We say that LC follows from 
LA and LB by substitution, 

A proof or computa t ion is formally defined as a Sequence of equations 
each Of which is an instance of a recursion axiom or else follows from two 
previous equations by substitution. We consider two simple yet illuminating 
examples. 


Exampl e 
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3- Commenting Proofs. 

We begin cementing these computations by means of intrinsic and 
extrinsic remarks [this terminology is after Sussman), Then we use those 
remarks to generate ids between the occurences of symbols in the computa¬ 
tion, By tracing out chains of ids we can establish an accountability for 
every sign in the computation. This will at the sane time make explicit 
the semantics of the computation, i.e, which occurences of symbols are 
synonymous and what are the computational roles of each sign. 

The 1htrinslt (or formal ] consents make note of- 

1) if the line is an axiom,in which case a pointer is generated to the 
axiom schema in question; 

2) if the Tine follows by substitution, in which case pointers are generated 
to the lines from which it follows and to the occurences of the term to be 
replaced. 

The extrinsic (or rela ted) comment? consist oft 

1) the top level goal statement (i-e f to find the value of term t); 

2) the assertion that lino L £s * s') is generated In order to simplify 
by substitution a term t{s) in a previous line L'; 

3) the assertion that line L" is the result of a substitution rule from 
lines L, L 1 whose purpose It was to achieve simplification by means of this 
substitution; 

4} the assertion that the line catches the top level goal. 

Example 2,1 Conmented . 

The top level goal is to evaluate 1 + IK 

Ll 1 + 11 * (T + 1)1 (Axiom A£) (Purpose is to simplify 1 + 11 of t.T.g.) 

L2 1+1 £ 11 (Axiom Al) (Purpose is to simplify t +■ 1 of r,h,s, 

of Ll using substitution.) 









L3 1 + 11 - 111 


(Sub, U»L2) {Purpose is to fulfill the goal 


of L2. L3 matches t.Kg.} 


Example 2.2 is commented in a similar manner. 


4. Identifications! Connections, 

He now add a third type of comment to the analyzed computation, namely 
we make note of the identificational connections (Ids), First of all each 
axiom is to be accompanied by certain ids as. follows, 

A1 t + 1 = tl 


A2 t + si * \t + 5)1 


mi t r 


M 2 t" (sl ) * t-s + t 



£1 exp(t s l] B t 



12 exp(t + sl) - expct.5 ) L t 


For example, in P*2 we would say that t and s in the r.h.s. are re¬ 
written from; the t and s In the lh,s. and this justifies their synonymity. 
On the other hand we also mate an identtficational connection between the 
two occurences of 1 and this constitutes our semantical interpretation 
of + in terms of the sucessor function. 


Itow consider M2, He say that both t's in the r.h.s, are rewritten from 






the kh,s. t. Similarly the s in the r.h.s. is rewritten from the s in 
the l.ih.s. However we associate the second t in the r,h,S- with the stroke 
] of the l.h.s,—this Is part of our semantical interpretation of - . Thus 
we are looking at ft A as saying add m to itself n times; in this computation 
n acts as a counter for the n different rewritings Of m„ The strokes of n 
are -control elements in this case, 


Let A he an axiom and id{A> the set of Ids associated with A. More ex- 
plicily; if a stroke p on the r.h.s is simply rewritten from a stroke q on 
the l,hrS, (without a control element) then put 1d(p,q) in id(A); if, on 
the other hand p in the r.h.s. is rewritten from, q in the l.h.s, under the 
control element q‘ and using an axiom for the operation f (either ■ or exp) 
then pot tdtp.^q.q') in id(A).^ 


Ids come from* the substitution rule in accordance with the following 
schema, 

LA t = r(s) 


LB 


1 / 






LC t = r(s') ♦ 


(These ids actually come from the extrinsic 
comments associated with these lines: LB'S 
purpose is to simplify the indicated s in 
r.h.s. of LA. Hence the s in the l.h-S- 
of LB Is rewritten from the 5 In the r.h.s. 
of LA. LC achieves the goal of LB; hence 
the context r in the r,h,s, of LC is re¬ 
written from the context r in the r.h.s, of LA and the s* in the r.h.s, of LC 
is rewritten from the s’ of the r.h.s, of LB- Also, the l.h.s. of LC is rewritten 
from the l.h.s of LA,) 


1) A set of ids like id(A) Is usually taken symetrically,i.e., if id(u.v) 
is in id(A) then so is id(v t u). 






Fully commented hy ids Examples 2,1 and 1,1 look like the following. 



L3 l + 11 * 111 

Example 1.1 [Commented by ids,} 


li iik'^1 1 + 1 




can now trace out paths of ids, thereby diagramming the computational 
relations between the different occurences of strokes in the proof. For 
example, in Example 2*1, if wo denote the occurences of strokes in L3 by 
p,p' ,p lc ,q, q\q dl {left to right} then we can see that p- is connected to q, 
p 1 is connected to q' and p" is connected to q'\ Furthermore these are the 
only connections between the strokes of L3, This yields a very nice correspon¬ 
dence between the l,h.s, and the r.h.s. strokes of L3. 


The case of Example 1.1 is more complicated in that some of the occur- 









rences of strokes act as counter elements and when In this role do not 
get rewritten- To present a complete analysts Of Ids In Example 2.2 we 
shall lable all occurences of strqJces and to the right of each line we shall 
list the id associated with 1t< Between the lines we shall put the interline 


ids conning from the extrinsic comments, 

LI 1 * 11 -1-1 +1 td{a £t t>2), fd{{a t *a p *) ,b 3 ) } 

a - ! a £ a 3 b-| b £ b^ 

{1dfb r c ; ) p 1d(t> z *e 2 }J 

L 2 1 * 1 =1 iid{{c v c 2 ,- ),d T } 1 

C 1 di 

|id[a T - ,ei) for 1 = 1,2,3* 1 d(d] ,fj}„ Idfb^fg)] 

L3 1 ■ 1 1 =1+1 [idUve^O^h 1d((e-| *e 3 .- 

e 1 e 2 e 3 f l f 2 
ild(f la g 1 )Jdff ? *g 2 }] 


L4 1 +■ 1 =11 {1dfa 1 ,ll|), Td{g 2a h2) 1 

9] h-jh^ 

|id(ef jP| J for i = 1*2*3, 1 d{N-j.q| >* id(h z .q£)J 

L5 1 ■ 1 1 ' 1 1 [fd{(|>i tPg,* > k qi), id((p 1 *p 3 *'}*q 2 )] 

P| P £ P 3 

The ids in square brackets are derived, from other ids. For example, 
1d((e],e 2 , r ),fj) Of 13 is derived from: 

fd({c T .G 2 *-M 1 ), idfb^tj), idfbjjCj), id(a|,b^). id(a £> b £ )* 
1d(ipe 1 } 1 idfag.ej) and td(d^fj). 


The rules to derive ids will be formulated in section 6, 


Observe that it is possible to trace out a path from the stroke q ] or 
of L5 to a pattern of strokes p-]*p£,and of L&> this is exactly the 
contents of the square brackets acd^mpanyirng L5. Thus we are lead to say 
that qj is rewritten from p-j under the control of p 3 * (p]*p 2> ’J and 


(p^pPgj"} are the patterns in T*T1 of LS ‘"responsible" for q-j and re¬ 
spectively in this computation. These patterns we call the ingredients 
of 1 - 11, Generally speaking when a term t Is evaluated (i.e. proved equal 
to a numeral It f J we may identify the Ingredients of t responsible for 
each stroke in jt |. We want to show that the value of t is independent of 
the computation path. We shall also determine the way In which ingredients 
depend on the computation path. 

5, Computation Paths. 

On the previous pages we have presented two examples In some detail 
Inorder to give an intuitive picture of what is happening with proofs * 
comments and ids. We shall now turn to a detailed study of the possible 
computation paths from a term. 

we shall restrict our attention to computations which have the 
(standard} farm: 

Ll t ] ■ 1 1 

L2i-1 = t 1+1 

L2i - £{‘ 

LZi+1 tj - t 1+z 

L2n-3 t T = t n , 

where Ll is an axiom and for i = 1 T ...*n-2, L£1+l follows from L21-1 and 
1.21 by substitution. We shall assume that these computations have been 
commented and the appropriate ids have been made. 


The sequence of terms t (=t-|) 3 tt n is called a computati on path 




from t to t , 
n 


Consider a Step of the computation {called a simple reduction) 


LA t •* f{s) 



Define ID{r{s) — r(s’)) to be the set of derived ids between the signs 
in r(s) and r(s 1 ), Specifically: 

1} if p ts an occurence of a stroke in r(s J ) which is rewritten from q in 


r(s) via id a then id{p,q) is in lD{r(s} -* r(s')]* 

2) If p is an occurence of a stroke in r(s') which is rewritten from q in 

s f of LB via c and q Is rewritten from q' in s of the l,h«s. of LB„(1,e. 

idfq'jq) is in id(s=s')*sae p.7), and q 1 is rewritten from q r| in s fn the 
r.h.s, of LA via t> * then id(p T q") Is in ID frfs) r{s 1 )) . 

3) If p is an occurence of a stroke in r(s H ) and is rewritten from: q in 

s' of LB via C and q is rewritten from q' in s in the l*h.s. of LB under 

the control of q" in the l.h.S. of L& for the funtion f (i-6- 1d(q,[q‘»q" ,f)l 
is in id(s = s r )) s and q 1 and q 1 " are rewritten from And q** respectively 
in s in the r.h.s, of LA via b then id(p, fq°3 > is in ID{r{s) — r(s')). 


Every ocurrence of a stroke p in r[s') of LC is associated through 

ipfrfs)_-r(s'J} with a unique stroke q In r(s} of LA or a unique pattern 

of strokes and q°° in r(s) + 

If t is a computation path P and t n is a numeral then 

P is called an eval uat ion path and t n is called the value of t w.r.t.P , 





Define T{t ) to be the set of all term occuring In computation paths 
from t; the relation s-*-r determines a partial ordering of T(t) which we 
win now investigate,^ 

The notation 5--*r denotes a commutation path from 5 to r, We say 
that tj—*-t n is a su bterm path If non of the reductions t^—^t^ + i involve 
the main function {outer most function symbol in polish notation), 

Defi nit ion 5J , A loop consists of two different computation paths going 
from, a term r to a tern s. A diamond is a loop which has either form I or II 
below, Ve Illustrate this using ■ as the main function. 


Form 

I. 

s j r 
/ ^ 

{where the two indicated paths 



t * 

f i 
* / 

V g 

s' -r* 

are subterm computation paths) 

Fo rm 

II, 

s ‘H 
" 'Si 

{where both broken paths are subterm 


s' r r 1 1 

s-r *■ s 

paths. We may assume that each involves 


N* 



s' 

j r' + s r 

the same associated subterm paths s-+s 1 t r-+r', 


Definition Two computation paths are simple vari ants if they differ by 
a diamond, f,e, they look Tike 
t 

i 


4 



t 

+ , 
t 1 

Two paths P and Q are ho.tnoj^^0us If there is a sequence of computation paths 
Pl,P2,,..jPn such that P - Pi and Q - Pn and for i ■ 1,.., f n-1 s Pi and Pi +1 
are simple variants. Mote that homologous evaluation paths assign the same 
value to a term. 

1) If S = S 1 is an axiom then we shall also wright s-^ws 1 , and call it 
a simple reduction. In this case lD(s—^-s 1 ) is taken as 1d{$ = s’). 











Laima 5,3a. A split of the form 


s 1 -r 


s" 1 r 


or 



can bo resolved Into a diamond, {The dotted paths are subterm 35 aths. Multi¬ 
plication Is just serving as a paradigm case.) 

Ultima 5,3b . Any split tart be resolved into a loop. 

Lefncna 5*3 Is an analogue to the Church-Rosser theorem for the Lamda- 
calculus. The proof Of this and other results is by induction on the rank 
rfc{t) Of a term,where rk Is an integer valued function {prinative re¬ 
cursive) defined so that a) if s Is a subterm of t then rk{s) -c rfc(t),and 
b) tf s—*-t then rk(s)► The existence of such a function shows us 

that any computation path from t has less than or equal to rit{t) steps. 

If we were using all primitive recursive functions instead of just + T j p 
and exp then such a rank function could be general recursive but not primi¬ 
tive recursive (e.g. like Ackerman's function }. The organization of the 
proof is to dovetail 5*3a and 5.36* first proving 5.3a for rjt(t) - n and 
then 5.3b for rk[t) - n. 

T heorem S,4 . Any two evaluation paths for a term t are homologous. 

Thus the value of a term t is independent of the evaluation path; (t| 
will denote the value of t.+ 

If we restricted substitution to uniform substitution then Tft) 
would have a much simpler form., namely splits t would resolve to t 







This would simplify much of our work. However since our eventual goal 
is the study of general proofs such a restriction would have to be 
relaxed. 

6. Ingredients. 

definition 6.1 . Let q-| denote the occurrences of strokes In the 

term t. Ing{t} denotes the set of list expressions obtained from q-| . T q n , 

and -»exp according to the following rules. 

1] q-j,...,q n are in Ing(t). 

2) If i_ and j_ are in Ing{t) and f is ■ nr exp then (j_.j_.fj is In Ing(t). 

The members of Ing(t) are called the abstract i ngred ients of t s and 
qj s ...,q are called the simple ingredients of t; lng 0 (t) is the set of 
simple Ingredients of t. In j_ is called the control element and 

t is called the raw material. 


definition H i Ing(t)—irlngfs) is a homorphism (horn) iff for all 
in Ing(t) p 

,f). 

Fa cts about homomorphis ms: 

d.3a. If H P G : Ingft)—►IngtsJ Are homomorphisms which agree on Ing Q {t) 
then H ■ G. 

6.3b Any map H : Ing 0 (t}—Mng(s) extends to a unique horn from Ing{t} 

into Ing(s), 

6,3c Horn H is 1-1 op Ing[t) Iff H is 1-1 on Ing 0 (t)« 












Me associate with a simple reduction t—the following horn 


Let lOft^s} be the set of ids accompanying t^s as defined on p.n,. 

H[t-»s] is the unique hem from Tng(s) into Ing(t) determined by (for 4 in Ing g {s)) 

! p If td(p s q) is in ID(t-*s), 

. 

(p.p\f} If id( (p * P 1 ,f) ,q} is in ID(t-s). 

H[t-+sJ Is 1-1 on Ing (J {s) and hence is 1-1 on aTl of Ing(s) r 

Definition 6,4 , Let P * be an evaluation path for t; thus ft| = t n - 

Define the hom H[P] - N[t-|-rt 2 ]^[t 2 ^-t 3 ]*"The set of 

( real } ingre dt ents w.r.t, P U the set H[P](Ing 0 (t)J and Is denoted by Ing(t;P), 

Note that H[P] is a 1-1 map from Ing 0 {[t[) into Ing(t). So the cardi¬ 
nality of Ing(t;P] is equal to the integer denoted by the numeral ft I , We 
think of H[P](q) as the unique "computational pattern" in t which Is "re¬ 
sponsible ' 1 for q via P* 


7. "Invariance" of Ingredients. 

How does H[P] - depend on P? Consider the following example. 

E xample 7,1 . ^6^7 

(11 ■ 1 1 1 ) ■ 1 1 

(ii + ii + ii}*ii 1 i *i 1 1 + 


r \ r 2 r 3 r 4 r 5 r 6 


r 7 r 3 




11111111 


S I S 2 S 3 S 4 S 5 
1 1 1 


11-111 

V? S G S 9 S 1 C 


P] Pj P 3 P 4 ^5 Pg Ps Pg PtoPuPiz 
C omputing H[P](p g ) and H[Qj{p g ) we get 

H[P](p 9 ) -1= ((q ls d 4 * r )^ 7 , J )4nd H[Q3(p 9 ) = 1= {{q p v M q 4 >V‘)*-5 ■ 






The has-ft difference between f_ and Is the temporal order in which 
the control ■elements q 4 and q ? are operating and not in the ultimate ■ 
control relationships. 

The relation between 1_ and £ is made Into 4 basic equivalence relation. 
Definition 7.2 . Let i_ and £ he ingredtents In Ing{t). i * i Iff there is 

a sequence of pairs fi-| ^ ).Cl^In) ingredients in Ing(t) {this 

sequence would be called a derivation of > j_J such that i_ - 1^, j_ - j^ p 
and for k = l*... T n , either 

» 4-Jfc- 

2} there exists Ingredients u r ,v*w and function symbols f,g such that 
' l({u f v,f),w f g) f ({u»w»gMViW»g)if) } , 

3} there exists k", k" less than k such, that 
Ik a and ^ 


7.3. Facts about i . 

7-3a) = is an equivalence relation on Ing(t). 

7.3h) If 1= j_and V 5 j. 1 then (i4 + f) s (V *£\t). 

7,3c) If Ijt E ik for k = 1, 2 , 3 then 

((ij ■>Aj«f)" «-J_3 i'S) - (vj_i»jjj *s) ■> (jj j 3 1 j g) »f J. 

Theorem 7.4 , Let H : Ing(s ) —p-Ing(t} be a homomorphism. Then H preserves s t 
i.e,,if 1_ and j_ are in Ing(s) and i i J_ then h(i) i U[j_) + 

Theoren 7,5 , Let G>H ; Xng(s}—wlng(t) be two homomorphisms and suppose that 
for all p in Ing 0 (sJ t G(p} = Hfp), Then GCJ_> » Hfi) for dll i_ in Ing(s), 


A suhterm computation P: s-t—ts’-t 1 can be decomposed in a natural 





way into two computation paths P(s) : s - - 1 and Pft) : t--*t\ If we 
Identify the strokes of s' with the corresponding strokes of s' In s'-t 1 
then we may consider Ingfs'J as a subset of Ing[s 1 *t b hand likewise for t\ 

The hom H[P] may then be decomposed into two horns H[P(s)] and H[P(t}]. 

We formulate this in the next lerma. 

Lama 7,6 , Let P t P(s) and P{t) be as above., then 
H[P] restricted to Irrg£s 1 > equals H[P(s)], and 
H[P] restricted to Ing{t 1 > equals H[P[t)J. 

An ingredient In Ing(t} is called a real ingredient if it is equal 
to Ji[P]fq) where P is some evaluation path from t and q is in lng 0 (ltl) r 

Theorem 7.7 , (Invariance of Ingredients.) Let P and q he two evaluation 
paths for t> Then for all p in Ing Q (t}, H[P]{p} s H[q]{ph 

This is an imiediate consequence of the next leirna. 

Lemma 7,!M et t be any term with rk(t) less than or equal to n + 

K If Pand Q are computation paths from t to t' forming a diamond and 
1 in Ing(t') is rEal then H[P](1J * H[q](i_). 

H. If P and q are computation paths from t to t 1 and i is in Ingft') and is real 
then H[P](i_) « W[Q](1), 

The lemma is proved by induction on n and dovtailing I and II. Lerana 5.3 
plays a major role. 

6. Final Remarks. 

The next step in this investigation extends our analysis to general 






proofs In first order logic and full Peano Arithmetic. This means associating 
Kith logical axioms the appropriate sets Of ids (e.g. consider ) 

and! determining the interline ids associated with the extrinsic conrientlng 
of proofs. These ids will provide the means of constructing list manipulating 
procedures from proofs which will give rise to semantic interpretations for 
these proofs* 

it is also striaghtfaward to apply these ideas to other deductive-ccmipu- 
tationat systems such as the lamda-talculus and Curry’s Combinatorial Logic, 


